Comparative analysis of criteria for filtering time series of word usage frequencies

نویسندگان

  • Inna A. Belashova
  • Vladimir V. Bochkarev
چکیده

This paper describes a method of nonlinear wavelet thresholding of time series. The Ramachandran–Ranganathan runs test is used to assess the quality of approximation. To minimize the objective function, it is proposed to use genetic algorithms one of the stochastic optimization methods. The suggested method is tested both on the model series and on the word frequency series using the Google Books Ngram data. It is shown that method of filtering which uses the runs criterion shows significantly better results compared with the standard wavelet thresholding. The method can be used when quality of filtering is of primary importance but not the speed of calculations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

AIOSC: Analytical Integer Word-length Optimization based on System Characteristics for Recursive Fixed-point LTI Systems

The integer word-length optimization known as range analysis (RA) of the fixed-point designs is a challenging problem in high level synthesis and optimization of linear-time-invariant (LTI) systems. The analysis has significant effects on the resource usage, accuracy and efficiency of the final implementation, as well as the optimization time. Conventional methods in recursive LTI systems suffe...

متن کامل

TREND-CYCLE ESTIMATION USING FUZZY TRANSFORM OF HIGHER DEGREE

In this paper, we provide theoretical justification for the application of higher degree fuzzy transform in time series analysis. Under the assumption that a time series can be additively decomposed into a trend-cycle, a seasonal component and a random noise, we demonstrate that the higher degree fuzzy transform technique can be used for the estimation of the trend-cycle, which is one of the ba...

متن کامل

Evaluation of SARIMA time series models in monthly streamflow estimation in Idanak hydrometry station

prediction of hydrological variables is a highly effective tool in water resource management. One of the important tools for modeling hydrological processes is the use of time series modeling and analysis. River series production series can be used by time series models in various studies such as drought, flood, reservoir systems design and many other purposes For this purpose, monthly flow dat...

متن کامل

Herbal plants zoning using target detection algorithms on time-series of Sentinel-2 multispectral images (Amygdalus Scoparia)

Today, medicinal plants have a special place in the economy and health of a society. Due to the natural growth of many of these products, the necessity of zoning them for optimum and optimal utilization seems necessary. Traditional zoning solutions are not efficient due to their low accuracy and speed, therefore a new approach is needed. Remote sensing data have many applications in various fie...

متن کامل

Application of Single-Frequency Time-Space Filtering Technique for Seismic Ground Roll and Random Noise Attenuation

Time-frequency filtering is an acceptable technique for attenuating noise in 2-D (time-space) and 3-D (time-space-space) reflection seismic data. The common approach for this purpose is transforming each seismic signal from 1-D time domain to a 2-D time-frequency domain and then denoising the signal by a designed filter and finally transforming back the filtered signal to original time domain. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1712.03512  شماره 

صفحات  -

تاریخ انتشار 2017